Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Mathematical knowledge browser with automatic hyperlink detection

Identifieur interne : 001161 ( Main/Exploration ); précédent : 001160; suivant : 001162

Mathematical knowledge browser with automatic hyperlink detection

Auteurs : Koji Nakagawa [Japon] ; Masakazu Suzuki (mathématicien) [Japon]

Source :

RBID : Pascal:08-0032035

Descripteurs français

English descriptors

Abstract

Mathematical OCR (Optical Character Recognition) systems retrieve character sequences and the structure of mathematical formulae from raster images scanned from mathematical documents. In this paper a method for detecting hyperlinks, e.g. formula links, from mathematical OCR output is described. We also experimentally demonstrated the effectiveness of the method. By using the method we implemented a prototype system of a mathematical knowledge browser that helps people read mathematical articles.


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" level="a">Mathematical knowledge browser with automatic hyperlink detection</title>
<author>
<name sortKey="Nakagawa, Koji" sort="Nakagawa, Koji" uniqKey="Nakagawa K" first="Koji" last="Nakagawa">Koji Nakagawa</name>
<affiliation wicri:level="4">
<inist:fA14 i1="01">
<s1>Faculty of Mathematics, Kyushu University, Kyushu Univ. 36</s1>
<s2>Fukuoka 812-8581</s2>
<s3>JPN</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<wicri:noRegion>Fukuoka 812-8581</wicri:noRegion>
<orgName type="university">Université de Kyūshū</orgName>
<placeName>
<settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Suzuki, Masakazu" sort="Suzuki, Masakazu" uniqKey="Suzuki M" first="Masakazu" last="Suzuki">Masakazu Suzuki (mathématicien)</name>
<affiliation wicri:level="4">
<inist:fA14 i1="01">
<s1>Faculty of Mathematics, Kyushu University, Kyushu Univ. 36</s1>
<s2>Fukuoka 812-8581</s2>
<s3>JPN</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<wicri:noRegion>Fukuoka 812-8581</wicri:noRegion>
<orgName type="university">Université de Kyūshū</orgName>
<placeName>
<settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<placeName>
<settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university" n="3">Université de Kyūshū</orgName>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">08-0032035</idno>
<date when="2006">2006</date>
<idno type="stanalyst">PASCAL 08-0032035 INIST</idno>
<idno type="RBID">Pascal:08-0032035</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000298</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000486</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000329</idno>
<idno type="wicri:Area/Main/Merge">001190</idno>
<idno type="wicri:Area/Main/Curation">001161</idno>
<idno type="wicri:Area/Main/Exploration">001161</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a">Mathematical knowledge browser with automatic hyperlink detection</title>
<author>
<name sortKey="Nakagawa, Koji" sort="Nakagawa, Koji" uniqKey="Nakagawa K" first="Koji" last="Nakagawa">Koji Nakagawa</name>
<affiliation wicri:level="4">
<inist:fA14 i1="01">
<s1>Faculty of Mathematics, Kyushu University, Kyushu Univ. 36</s1>
<s2>Fukuoka 812-8581</s2>
<s3>JPN</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<wicri:noRegion>Fukuoka 812-8581</wicri:noRegion>
<orgName type="university">Université de Kyūshū</orgName>
<placeName>
<settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Suzuki, Masakazu" sort="Suzuki, Masakazu" uniqKey="Suzuki M" first="Masakazu" last="Suzuki">Masakazu Suzuki (mathématicien)</name>
<affiliation wicri:level="4">
<inist:fA14 i1="01">
<s1>Faculty of Mathematics, Kyushu University, Kyushu Univ. 36</s1>
<s2>Fukuoka 812-8581</s2>
<s3>JPN</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<wicri:noRegion>Fukuoka 812-8581</wicri:noRegion>
<orgName type="university">Université de Kyūshū</orgName>
<placeName>
<settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<placeName>
<settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university" n="3">Université de Kyūshū</orgName>
</affiliation>
</author>
</analytic>
<series>
<title level="j" type="main">Lecture notes in computer science</title>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<title level="j" type="main">Lecture notes in computer science</title>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Automatic measurement</term>
<term>Character recognition</term>
<term>Hyperlink</term>
<term>Information browsing</term>
<term>Knowledge engineering</term>
<term>Mathematical formula</term>
<term>Mathematics</term>
<term>Optical character recognition</term>
<term>Raster</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Ingénierie connaissances</term>
<term>Mathématiques</term>
<term>Navigation information</term>
<term>Mesure automatique</term>
<term>Reconnaissance caractère</term>
<term>Reconnaissance optique caractère</term>
<term>Lien hypertexte</term>
<term>Formule mathématique</term>
<term>Trame</term>
</keywords>
<keywords scheme="Wicri" type="topic" xml:lang="fr">
<term>Mathématiques</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Mathematical OCR (Optical Character Recognition) systems retrieve character sequences and the structure of mathematical formulae from raster images scanned from mathematical documents. In this paper a method for detecting hyperlinks, e.g. formula links, from mathematical OCR output is described. We also experimentally demonstrated the effectiveness of the method. By using the method we implemented a prototype system of a mathematical knowledge browser that helps people read mathematical articles.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Japon</li>
</country>
<region>
<li>Kyūshū</li>
<li>Préfecture de Fukuoka</li>
</region>
<settlement>
<li>Fukuoka</li>
</settlement>
<orgName>
<li>Université de Kyūshū</li>
</orgName>
</list>
<tree>
<country name="Japon">
<region name="Kyūshū">
<name sortKey="Nakagawa, Koji" sort="Nakagawa, Koji" uniqKey="Nakagawa K" first="Koji" last="Nakagawa">Koji Nakagawa</name>
</region>
<name sortKey="Suzuki, Masakazu" sort="Suzuki, Masakazu" uniqKey="Suzuki M" first="Masakazu" last="Suzuki">Masakazu Suzuki (mathématicien)</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001161 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 001161 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     Pascal:08-0032035
   |texte=   Mathematical knowledge browser with automatic hyperlink detection
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024